Window Document (Text Processing)
Synopsis
Windows a document and returns a collection of the windows.Description
This operator moves a sliding window over the tokens of a document and returns a collection containing a new document for each window. The size of the sliding window may be adapted as well as the step size the window is moved in each step.
Input
- document
The document port.
Output
- documents (Collection)
The documents port.
Parameters
- window_lengthDefines the number of tokens a window covers. The resulting document will contain a token sequence of that length. Range:
- step_sizeDefines the number of tokens between the start of two windows. A step size of one would case each token to become first token of a window. Range:
- extend_last_windowIf checked, the last window will be extended, so that it covers all remaining tokens. Otherwise incomplete windows will be added. Range:
- parallelize_segment_processingDetermines whether the execution of Segment Processing should be parallelized. Range: